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Mindfulness programs for schools are popular. We systematically reviewed the evidence 
regarding the effects of school-based mindfulness interventions on psychological 
outcomes, using a comprehensive search strategy designed to locate both published and 
unpublished studies. Systematic searches in 12 databases were performed in August 
2012. Further studies were identified via hand search and contact with experts. Two 
reviewers independently extracted the data, also selecting information about intervention 
programs (elements, structure etc.), feasibility, and acceptance. Twenty-four studies were 
identified, of which 13 were published. Nineteen studies used a controlled design. In total, 
1348 students were instructed in mindfulness, with 876 serving as controls, ranging from 
grade 1 to 12. Overall effect sizes were Hedge's g = 0.40 between groups and g= 0.41 
within groups (p < 0.0001). Between group effect sizes for domains were: cognitive 
performance g = 0.80, stress g = 0.39, resilience g = 0.36, (all p < 0.05), emotional 
problems g = 0.19 third person ratings g = 0.25 (both n.s.). All in all, mindfulness-based 
interventions in children and youths hold promise, particularly in relation to improving 
cognitive performance and resilience to stress. However, the diversity of study samples, 
variety in implementation and exercises, and wide range of instruments used require a 
careful and differentiated examination of data. There is great heterogeneity, many studies 
are underpowered, and measuring effects of Mindfulness in this setting is challenging. 
The field is nascent and recommendations will be provided as to how interventions and 
research of these interventions may proceed. 
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INTRODUCTION AND BACKGROUND 

The application of Mindfulness-Based Interventions (MBIs) has 
become increasingly popular in the last few years, both in research 
and practice. Mindfulness can be defined as the psychological 
capacity to stay willfully present with one's experiences, with a 
non-judgemental or accepting attitude, engendering a warm and 
friendly openness and curiosity (Kabat-Zinn, 2005). 

Originally derived from eastern traditions and Buddhist psy- 
chology, mindfulness can be cultivated by various techniques 
(Bankart, 2003; Wallace and Shapiro, 2006). Formally, it is trained 
by meditation practices such as sitting meditation, or physical 
movement such as yoga or tai chi. These techniques help steady 
the mind and train its attentional capacity, while also increas- 
ing its breadth of focus. Practitioners are instructed to focus their 
attention on the present moment using an "anchor," for instance, 
the breath. When the mind drifts away, the focus is gently brought 
back to the present moment experience. The practitioner tries 
to simply observe his or her experience of the present moment 
without judging or modifying it. 

Roughly 30 years ago, Jon Kabat-Zinn introduced mindful- 
ness as a resource into clinical research and practice through 
the Mindfulness-Based Stress Reduction Program (MBSR). The 
MBSR program consists of 8 weekly sessions of 2 1 /2h, and 
a day of mindfulness. Mindfulness is practiced formally in 



sitting meditation, by simple yoga movements, and in the body- 
scan, which is a gradual sweeping of attention through the 
body. Mindfulness is also cultivated in daily activities such as 
eating, and by using it as a resource in emotionally challeng- 
ing situations or in dealing with physical pain. The recom- 
mended daily home practice lasts approximately 45 min, and 
includes formal and informal exercises. Moreover, the program 
includes psycho-education, and attitudes such as not judging, a 
beginner's mind, trust, non-striving, acceptance, letting go, and 
patience are encompassed (Kabat-Zinn, 1982, 1990, 2003). The 
MBSR program became the parent to several variations, such as 
Mindfulness-Based Cognitive Therapy (MBCT; Segal et al., 2002), 
initially developed for preventing relapse of depression. In other 
cognitive-behavioral therapies, such as acceptance and commit- 
ment therapy, (ACT; Hayes et al., 1999) and dialectical behavior 
therapy (DBT; Linehan, 1993), the emphasis of treatment lies on 
acceptance as well as on change. 

In several reviews and meta-analyses, MBIs proved to be effec- 
tive in a wide range of stress related and clinical problems and dis- 
orders for various disease groups (Grossman et al., 2004; Fjorback 
et al, 2011; Piet and Hougaard, 2011; Piet et al., 2012). In 
addition, an interesting aspect of MBIs is their potential preven- 
tive and health promoting capacity in non-clinical populations: 
reducing stress, increasing well-being and strengthening immune 
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functions (Davidson et al., 2003; Chiesa and Serretti, 2009; Eberth 
and Sedlmeier, 2012); promoting personal development such as 
self-compassion, empathy and perspective taking (Shapiro et al., 
1998, 2007; Birnie et al., 2010); increasing attentional capacity 
(Jha et al., 2007; Tang et al, 2007) and the temporal window of 
attention (Sauer et al, 2012). 

One potential mechanism could be through decreasing the 
tendency to avoid unwanted experiences, thus generally improv- 
ing positive affect (Sauer et al., 2011a,b). Mindfulness seems to 
be the opposite of mind-wandering (Smallwood and Schooler, 
2006). Mind- wandering has been linked to the activity of the 
default-mode network (DMN), i.e., those areas of the brain that 
become active when the cognitive system remains idle (Raichle 
et al., 2001). Interestingly, experienced Zen meditators show 
reduced baseline activity of the DMN (Pagnoni et al., 2008). 
Since a higher activity of the DMN is related to increased neg- 
ative affect and to the rate of mistakes in attentional and other 
tasks (Smallwood et al., 2011), it seems natural that reducing 
mind-wandering and improving attentional capacities could be 
beneficial in many respects, and might be one of the generic 
mechanisms through which mindfulness-based approaches work 
(Carmody, 2009). 

Given the diverse usefulness and beneficial record of MBIs for 
adults, researchers and clinicians are striving to develop adap- 
tations for children and youths. Research is in its infancy, but 
initial reviews suggest that MBIs are feasible with children and 
adolescents and seem to be beneficial in both clinical and non- 
clinical samples (Black et al., 2009; Burke, 2009). They have been 
successfully applied to adolescents with attention deficit hyper- 
activity disorder (ADHD) symptoms (Van der Oord et al, 2012; 
Weijer-Bergsma et al., 2012), and to adolescents with a vari- 
ety of externalizing disorders (Bogels et al, 2008). MBIs lead 
to a reduction in symptoms of depression in minority children 
(Liehr and Diaz, 2010) and to a reduction in anxiety and increase 
of social skills in students with learning disorders (Beauchemin 
et al, 2008). In a study of "at-risk" and HIV-positive youth, 
decreases in hostility and general and emotional discomfort have 
been reported, while qualitative data indicated improvements in 
academic performance, interpersonal relations, stress-reduction, 
and physical health (Sibinga et al., 2011). Also, first conceptual 
frameworks have been created as to why MBI's are beneficial for 
children and youth and how mechanisms might work (Mind and 
Life Education Research Network (MLERN), 2012; Zelazo and 
Lyons, 2012). 

School appears to be an appropriate setting for such interven- 
tions, since children spend a lot of time there and interventions 
can be brought directly to groups of children in areas of need 
as part of a preventive approach at little cost (Weare and Nind, 
2011). Mindfulness can be understood as the foundation and 
basic pre-condition for education. Children need to learn to 
stop their mind wandering and regulate attention and emo- 
tions, to deal with feelings of frustration, and to self-motivate. 
Mindfulness practice enhances the very qualities and goals of 
education in the 21st century. These qualities include not only 
attentional and emotional self-regulation, but also prosocial dis- 
positions such as empathy and compassion, self-representations, 
ethical sensitivity, creativity, and problem solving skills. They 



enable children to deal with future challenges of the rapidly 
changing world, ideally becoming smart, caring, and committed 
citizens (Shapiro et al., 2008; Mind and Life Education Research 
Network (MLERN), 2012). 

Concurrently, reports of increasing clinical problems in chil- 
dren, stress-related problems and problems related to social pres- 
sure in and outside school are worrying. Children and youth 
frequently experience stress in school (Currie et al., 2002; Lohaus 
and Ball, 2006; Card and Hodges, 2008), which has an impact 
on the brain structures involved in cognition and mental-health 
(Lupien et al, 2009). Serious mental disorders are also widespread 
among children. It has been reported that 21% of the 13 to 18 year 
olds in the US are currently suffering, or have at some point dur- 
ing their life suffered, from a severe mental disorder (Merikangas 
et al., 2010), with ADHD, behavioral or conduct problems, anx- 
iety, and depression being the most prevalent current diagnoses 
(US Department of Health and Human Services, and Centers for 
Disease Control and Prevention, 2013). 

Formal education should always consider the mental health 
and balance of children. A growing body of research shows that 
"academic achievement, social and emotional competence and 
physical and mental health are fundamentally and multiply inter- 
related. The best and most efficient way to foster any of those is 
to foster all of them" (Diamond, 2010, p. 789). Schools are there- 
fore confronted with the task of not only being institutions for 
formal education, but also a place that provides tools for prevent- 
ing disorders and fostering personal development and well-being 
in children. These needs have driven educators, teachers, and psy- 
chologists to seek methods to improve school-based learning and 
the social experience connected with it. MBIs in schools are seen 
as an approach to tackle these challenges, because prevention 
and education can be provided simultaneously, addressing a wide 
range of needs and unfulfilled potentials of students. 

As a result, various mindfulness programs for schools have 
been developed and applied within the past few years (see 
Meiklejohn et al., 2012 for an overview). Several research insti- 
tutes and associations, such as the Garrison Institute, are initi- 
ating workshops and conferences on Mindfulness in Education 
on a regular basis. Within mailing lists administrated by the 
Mindfulness in Education Network (www.mindfuled.org) or the 
Association of Mindfulness in Education (www.mindfuleducation. 
org), clinicians, educators, and researchers from all over the world 
share ideas, material and experiences of mindfulness in schools. 
The increasing amount of meetings, books, and newspaper arti- 
cles indicate that the integration of mindfulness into education is 
received with great interest and is seen as a potentially plausible, 
cost-effective, and promising approach. 

The number of studies evaluating MBI's in school settings is 
also growing. However, others point out that, to date, enthusi- 
asm about the integration of MBI's in schools surpasses evidence 
(Greenberg and Harris, 2011). The diversity of programmes and 
outcome measures combined with the pilot-character of most 
studies make it difficult to get a general impression of effective- 
ness, and directions of further research cannot be easily derived. 
Presenting a narrative review on the literature, Meiklejohn et al. 
(2012) made a good start summarizing the research published to 
date, but a quantitative synthesis exclusively integrating studies 
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on MBI's in school context is still lacking. Specifically, it would be 
helpful to know if there are specific domains in which MBI's are 
particularly beneficial. At this point the inclusion of unpublished 
literature, such as doctoral theses, would enrich the discussion, 
as these often contain supplementary information that could 
be valuable and could introduce new approaches to this spe- 
cific research field, such as, for example, the choice of measures. 
Also, little is known about the feasibility of integrating MBI's 
into school-routine, for example, the acceptability of different 
programme elements. 

To help progress this field of research, we decided to carry out 
a meta-analytic review. Aiming to give a complete insight into the 
actual state of the art, we adopted a very open and comprehen- 
sive stance by locating as many studies as possible, both published 
and unpublished, and by including all relevant material. First, we 
addressed the types of mindfulness interventions that have been 
applied and the measures used in order to provide a transparent 
overview of the field. Second, we explored how MBI's work in 
a school setting: collecting findings on feasibility and acceptabil- 
ity. With a view to provide recommendations for future research, 
third, we ascertained the quality of the existing trials and iden- 
tified possible methodological challenges. Fourth, we carried out 
a quantitative synthesis in order to ascertain whether effect sizes 
warrant pursuing this line of research further. By also deriving 
domain-specific effect sizes, we aimed to clarify the diversity of 
outcome measures and to address the issue of which domains 
might be most beneficial for school children. 

Since the work was exploratory, it was intended to give orien- 
tation and develop further hypotheses rather than to test them. In 
the following, we present a systematic review of the literature and 
a meta-analysis of the available information. 

METHODS 
SEARCH STRATEGY 

A comprehensive search strategy was chosen in order to 
locate both published and unpublished studies. In August 
2012 systematic searches were performed in 12 databases 
and catalogs including Web of Knowledge, SciVerse Hub, 
PsychARTICLES, PSYNDEX, Psychology and Behavioral Sciences 
Collection, ERIC, FIS, The DART- Europe E-Theses Portal, PDQT 
Open, DissOnline, Openthesis, and UMI Dissertation Express. 
Mindfulness_ was used as the key word, combined with School_, 
Classroom_, or Education_, where appropriate. Studies were 
searched from the first year the database was available and no 
language restrictions were applied. 

After removal of duplicates and screening abstracts of the 
remaining studies, full-text articles of relevant studies were 
retrieved for examination. The reference lists of the selected 
articles were inspected and authors of relevant studies were 
contacted. Emails were sent to the mailing list of Mindfulness 
in Education Network and the Association of Mindfulness in 
Education in October 2012. All volumes of the Mindfulness 
Research Monthly Newsletter and Mindfulness Journal were 
screened up to and including October 2012. 

The first two authors independently extracted the data 
from the original reports in order to decide on inclusion. 
Disagreements were solved by discussion. 



INCLUSION CRITERIA 

Studies were selected if the following criteria were met: 

( 1 ) Interventions were mindfulness-based. 

(2) Implementation took place in a school-setting. 

(3) Participants were pupils or students from grade 1 to 12. 

(4) Outcomes were quantitative data, referring to psychological 
aspects. 

We sought interventions based on the concept of mindfulness, 
with classical mindfulness practices such as mindful breathing 
or the body scan as core elements. Combinations with other 
methods, such as massage, imaginary journey, or games, were 
accepted as long as their implementation was aimed at cul- 
tivating mindfulness, making it easily accessible for the tar- 
get age-group and setting. Approaches combining mindfulness 
and other established techniques such as Autogenic Training or 
Progressive Muscle Relaxation were excluded, because outcomes 
cannot clearly be attributed to mindfulness. For the same reason 
evaluations of trainings mainly based on concentrative medita- 
tion, such as Transcendental Meditation, were also excluded. No 
further methodological exclusion criteria were applied. 

DATA EXTRACTION 

Data on methodology and outcomes of included studies were 
extracted and coded by the first author and checked by the sec- 
ond author. These data covered information on schools and 
participants, sample size and study design, applied measures, 
type of statistical analysis and major findings reported, as well 
as data necessary for calculating effect sizes. Relevant informa- 
tion concerning interventions and feasibility was extracted by the 
second author and checked by the first author. This information 
included setting, structure, and elements of intervention and var- 
ious aspects of feasibility (e.g., acceptability, fidelity, attrition). In 
cases where important information was missing, study authors 
were contacted. 

STATISTICAL METHODS 

The weighted mean effect size (ES) g was chosen as a statistic for 
final analysis. Hedges's g is a variation of Cohen's d (Cohen, 1988), 
standardizing the mean difference by a pooled standard deviation 
using n-l for each sample (Hedges and Olkin, 1985). 

Mi -Mi I (fli- 1)4 + {n 2 -1)4 m 

ghedges= With S poo i ed = W 1 z (1) 

Spooled V «1 + «2 - 2 

ESs were then multiplied with c(m), a correction factor to correct 
potential bias due to small sample sizes. 



where m refers to degrees of freedom used to estimated Spooled 
(Hedges, 1981). Hedges's g can be interpreted according to 
Cohen's ES conventions (1988) as small (0.2), medium (0.5), and 
large (0.8). 
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Within-group ES were calculated for all relevant measures in 
every study. For controlled trials ES of baseline equivalence and 
differences in change scores were also derived. 

In several cases means and standard deviations were not 
reported. If statistics like partial eta-squared (interpreted as r 2 ), 
t- or P-values were given, g could be derived according to spe- 
cific formulas. In other cases, all essential data were missing and 
authors did not provide them after being contacted. In order to 
prevent bias due to missing data, ES were estimated in alterna- 
tive ways (marked with a #). Lacking means, for example, could 
be derived from graphs (8, 14). Missing SDs for within-group 
differences were estimated by deriving standard error of change 
score differences (8), or were derived from SD of within-group 
differences, assuming that population variance at time 1 and 2 
was equal (18). In another study, standard deviations of the norm 
sample were used for ES calculation (22). If no information was 
neither reported nor could be extracted, results were suggested 
to be insignificant and thus ES were estimated as 0 (Rosenthal, 
1995). This was done for study no. 8, 12, 18, and 22 (see 
Table 1). 

Two kinds of overall ESs were estimated. First, a within-group 
effect size was derived, based on the average of pre-post changes of 
intervention group in every study. Second, a controlled between- 
group effect size was calculated for all controlled trials. It was 
based on average change score differences between interven- 
tion group and control. A change score comparison was chosen 
instead of a simple post-test comparison, because baseline equiv- 
alence could not be assumed for all studies, and this might bias 
the estimation of intervention effects. 

Standard errors of within group and controlled effect sizes 
were calculated according to the following formulas: 



SE 



within group 



n + 2(n - 1) 



and 



SE, 



controlled 



«1 + «2 



+ 



nif?2 2(«i + JI2) 



(3) 



Initially, we grouped ES into four domains which had been shown 
to be affected by mindfulness practice in adults according to 
measurement method and construct: perceived stress and cop- 
ing (S), factors of resilience (R), and emotional problems (E) 
were measured via self-report scales. A domain of cognitive per- 
formance (C) was measured by performance tests. Subsequently, 
given that a lot of studies used questionnaires for parents and 
teachers addressing various domains, we created a fifth domain 
containing third person ratings (T) exclusively. Independence of 
results was ensured for all analysis. Where a study contributed 
several ES to the same domain, ES were averaged. 

Reliability of measures could not be used to adjust effect-sizes, 
as authors did not consistently report reliability and the measures 
that were reported were not compatible with each other. 

The inverse variance random-effects model (DerSimonian and 
Laird, 1986) was chosen to carry out quantitative synthesis. This 
model incorporates an assumption that the population parame- 
ters vary from study to study. As a consequence, variation in effect 



sizes are not only caused by sampling error, but also occur due 
to differences between hyperparameter and population parameter 
values. Thus, results can be generalized beyond the included stud- 
ies. The between-study variance tau-squared (t 2 ) is the estimated 
standard deviation of underlying effects across studies. 

Heterogeneity between studies was assessed via the Q and the 
I 2 statistic. The Q-test determines the probability of sampling 
errors being the only cause for variance. Under the hypothesis 
of homogeneity among effect sizes, the Q statistic follows the 
chi-square distribution. As a result, significant Q-values can be 
considered as evidence for heterogeneity because variance is also 
due to differences between effect sizes. The I 2 index describes the 
percentage of the variability in effect estimates that is caused by 
heterogeneity. I 2 of around 25, 50, and 75% would be interpreted 
as low, medium, and high heterogeneity. To identify publication 
bias a funnel plot was used. A funnel plot is a scattergram where 
the ES is plotted at the horizontal axis and the study size is plot- 
ted on the vertical axis. With no availability bias, one should 
see a funnel turned upside down. In case of bias, when smaller 
studies without significant effects were not available, the scat- 
tergram should deviate noticeably from the symmetrical funnel 
shape. Additionally we used the fail-safe N as a rough measure 
of the robustness of our analysis against availability bias. The 
fail-safe number (kf s ) estimates the number of unavailable null 
result studies that would be required to render the overall p level 
of the meta-analysis insignificant. If the fail-safe number is large 
(larger than 5k + 10), essential influence of bias on mean effects 
of meta-analysis are unlikely (Rosenthal, 1991). 

FEASIBILITY 

When a new intervention has just been implemented, informa- 
tion on feasibility of the process is a rich source for improvement, 
refinement, and adaptation of the intervention at later stages. The 
term feasibility here is understood as assessing the applicability 
of the different programs, their strengths, and weaknesses. For 
this analysis of the data we assumed two different areas of focus 
(Bowen et al., 2010): (1). Acceptability: to what extent the program 
is judged as suitable, satisfying, or attractive to program deliv- 
erers (teachers) and recipients (students). (2). Implementation: 
to what extent the program is successfully delivered to intended 
participants in the context of daily school-routine. 

RESULTS 
TRIAL FLOW 

In Figure 1, the study selection process is visualized in a PRISMA 
flow diagram (Moher et al., 2009). The initial search provided 207 
possibly relevant records after duplicates were removed. One hun- 
dred and sixty-five records were excluded after screening, mostly 
because they were reports or conceptual papers rather than 
experimental or scientific studies. Further screening of 42 full 
manuscripts against inclusion criteria identified 24 studies. The 
most prevalent reasons for exclusion at this stage were that the 
intervention could not clearly be defined as solely mindfulness- 
based (.K" = 9 ) , but was combined with relaxation techniques such 
as Progressive Muscle Relaxation, visualization, or bio-feedback. 
Further, three studies were excluded because the intervention was 
implemented in a setting other than regular school life, such as 
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Records identified through 
database searching (K= 
193) 



Records identified through 
other sources [K= 47) 



Records after duplicates removed (K= 207) 



, i 

Records scre< 


1 . 

ined (K= 207) 




■ 


42 full-text arcticles assessed for 
eligibility 


1 


24 studies indue 
synt 


led in qualitative 
hesis 






24 of studies included in 
quantitative synthesis 
(meta-analysis) 



165 records excluded 

- Reports about experience 

- Theoretical and conceptual papers 

- Case studies 



18 full-text articles excluded 

- Intervention not only mindfulness- 
based [K=9] 

- No school setting (K= 3) 

- Inadequate methods (K = 4) 

- Full-text not available (K= 2) 



FIGURE 1 | Flow of information from identification to inclusion of studies. 



a summer camp for example. Finally, four studies did not meet 
methodical criteria as they used an ideographic approach (K = 2) 
or were case studies (K = 2). Authors of two unpublished studies 
which had been identified as potentially relevant in the second 
screening did not provide the full-text article or data (K = 1), 
or could not be reached (K = 1). Qualitative and quantitative 
syntheses are based on all 24 studies. 

GENERAL STUDY CHARACTERISTICS 

Study characteristics are outlined in Table 1. Of the 24 stud- 
ies that had been located, 13 were published in a peer- reviewed 
journal, and three were in press. Unpublished studies comprised 
manuscripts published on the internet (K = 2), unpublished 
data (K = 1), or Master's (K = 2) and PhD dissertation theses 
(K = 3). The earliest study was published in 2005. Fourteen stud- 
ies were carried out in North America, seven in Europe, one in 
Australia, and two in Asia. In total, 1348 students were instructed 
in Mindfulness, and 876 served as the comparison group, ranging 



from grade 1-12, reflecting age 6 to 19. Sample sizes of studies 
varied between 12 and 216. Studies differed greatly in how they 
described the setting, intervention, and sample. 

In eight studies, mindfulness training was implemented at ele- 
mentary school level (grade 1-5), in two studies at middle school 
level (grade 6-8), and in 14 studies at high school level (grade 
9-12). In one study, mindfulness was introduced to students from 
grade 7-12. In most studies, description of school, neighborhood, 
or participants was very limited. There was a wide variety of 
school types, including mostly public schools (urban and sub- 
urban), a private residential school, a catholic school for girls, a 
fee-paying boys' school, a rural high school, and a public alterna- 
tive high school. Where sample characteristics were mentioned, 
samples were mostly of low socio-economic status and students 
were described as low performing or "at risk." However, it is 
very probable that other samples might be from higher socio- 
economic backgrounds, which would result in a diverse range of 
sample characteristics (see Table 1). 
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INTERVENTIONS 

The programs of this database have been reviewed and rated 
into different domains according to underlying theory, objectives, 
components, and intensity. If an intervention is to be evaluated 
in terms of effectiveness, it is necessary that details of the pro- 
gram, such as the theoretical base, well defined goals, explicit 
guidelines, training, and quality control, are described (Weare 
and Nind, 2011) and steps of implementation are carefully docu- 
mented (Durlak and DuPre, 2008). Not all of the studies offered 
sufficient information on program details or implementation, 
and some additional work was necessary to gather sufficient infor- 
mation. This part of the analysis will be reported in another article 
(Herrnleben-Kurz et al., in preparation). Here we summarize 
basic details about interventions and programs. 

As can be seen in Table 2, the theoretical framework of the 
programs refers to the concept of mindfulness. In most cases the- 
ory is linked to previously existing mindfulness programs, such 
as MBSR, MBCT, DBT, and ACT. Some interventions also make 
reference to theories and findings from positive psychology, or 
combine MBI with a special group of school-based intervention 
programs, such as social and emotional learning (SEL). 

Manualized programs, such as MindfulSchools or Learning to 
BREATHE, were identified in two thirds of the studies. These 
programs were generally available but only two had an endur- 
ing presence of more than five years, and many did not contain 
sufficient guidance material for implementation. Others were 
reported to be manualized, but the material was not made avail- 
able (see Table 2). The programs themselves often define similar 



Table 2 | General features of MBI's applied. 


General features 


K 


% 




THEORETICAL FRAMEWORK 


Mindfulness 


24 


100 


Positive psychology (including SEL) 


9 


38 


Executive function 


6 


25 


USE OF PROGRAM MANUAL 


Existing since > 5 years (<2007) 


2 


8 


Existing since < 5 years 


13 


54 


Ad-hoc program 


9 


38 


INTERVENTION FEATURES 


Class by teacher 


7 


29 


Class by non-school trainer 


15 


63 


Class by teacher and non-school trainer 


2 


8 


INTERVENTION COMPONENTS 


Breath awareness 


24 


100 


Working with thoughts and emotions 


21 


88 


Psycho-education 


20 


83 


Awareness of senses and practices of daily life 


20 


83 


Group discussion 


18 


75 


Body-scan 


14 


58 


Home practice 


12 


50 


Kindness practices 


11 


46 


Body-practices like yoga 


6 


25 


Mindful movement other body-practices) 


5 


21 


Additional material 


10 


42 



objectives. These are mostly related to the assessment methods and 
mirrored in the domains which have been identified (see outcome 
methods below). 

Most programs contain more than one component to facilitate 
mindfulness, with observation of breath as the traditional essen- 
tial exercise, as well as psycho-education and group discussions 
(see Table 2). 

Predominantly, MBIs were conducted by professional trainers, 
most of whom were involved as study authors. Few interventions 
had been instructed by the class teachers, and not all had per- 
sonal experiences with mindfulness practices. Some had briefly 
been introduced to the topic, while others had undergone a MBSR 
course before implementation. 

The periods and intensity (frequency and length) of training 
varied from 4 weeks to 24 weeks with a median of 8 weeks, with 
45 min once a week in most programs. Some programs split this 
over several sessions per week. In total, interventions varied from 
160 to 3700 min of practice, with a median of 420 min. 

STUDY QUALITY ASSESSMENT 

As can be seen in Table 1, 19 of the 24 studies used a controlled 
design and five used a pre-post design. Randomized designs 
were realized in studies where mindfulness training was offered 
as an alternative or extracurricular activity at school {K = 10). 
Students who signed up for the mindfulness training were ran- 
domly allocated to either a mindfulness or control group. In 
one study, a group of students with matched backgrounds was 
invited to function as control. In quasi-experimental designs, 
mindfulness was taught in a classroom setting and another class, 
mostly the parallel class, served as control {K = 8). In another 
study (Study 17, Table 1) a reading training of the same inten- 
sity as the MBI took place. Selection and allocation of classes to 
interventions was mainly decided upon by the heads and class- 
room teachers. In four studies, classes or schools were randomly 
assigned to conditions. Follow up measures were collected in five 
studies. 

For every effect size we performed a post-hoc power analysis 
using the software program G*Power (Faul et al., 2009). Given an 
alpha of 0.05 (one-sided), and a power of 80%, a sample size of 
n = 41 was determined for pre-post ES to detect an effect of d = 
0.40. Twelve studies met this criterion. The same procedure for 
controlled ES revealed a sample size of n = 78 per group, which 
was achieved in three controlled studies. 

Fifteen studies reported data on attrition in the intervention 
group, in which rates varied between 0% (23) and around 40% 
(1, 19), either due to invalid or incomplete data (7, 10, 11, 12, 13, 
17, 23), or because students did not fulfill a defined amount of 
attendance or home practice (1, 5, 6, 8, 19). Eight studies speci- 
fied reasons for withdrawal, mostly naming scheduling conflicts, 
school transfers, or school absence. Two studies reported drop- 
outs due to parental refusal (12, 16) and in one case five students 
decided to leave the training after the first session (19). 

OUTCOME MEASURES 

A variety of measures were applied to investigate the effects of 
mindfulness training. We grouped the outcomes into the domains 
as follows: 
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Cognitive performance (C) 

Nine measures in total were classified in the domain of cog- 
nitive performance. In most cases, cognitive performance was 
quantified by attention tests (Studies 8, 12, 13, 17, 22, Table 1). 
A creativity test (3) was used in one study, and in another (13) the 
mind wandering paradigm was applied. Two studies (4, 6) used 
grades as dependent variables. 

Emotional problems (E) 

In the domain of emotional problems self-report questionnaires 
focusing on maladaptive emotion, cognition, and behavior are 
summarized, also including clinical symptoms, such as anxiety 
and depression (4, 7, 9, 12, 19, 21, 23), test anxiety (8, 12), somatic 
reactions (11, 16), ruminative thinking style (11) emotion 
regulation difficulties (6, 9, 11, 16), and various difficulties (23). 

Stress and coping (S) 

Nine Studies investigated changes of perceived stress and coping 
behavior via self-report questionnaires (7, 9, 10, 13, 16, 17, 19, 
20). In one study (12) Cortisol measures in combination with a 
stress test (math quiz) were carried out. These outcomes were 
examined separately. 

Resilience (Rj 

Seventeen studies collected self-report data on constructs we cat- 
egorized as factors of resilience: well-being (13, 14, 15, 17, 19, 20), 
positive and constructive emotions or affect (7, 11, 12, 13, 16, 18, 
22), resiliency (14, 15), social skills and positive relationships (7, 

21, 22, 23), self-concept and self-esteem (4, 5, 10, 18). 

Third person ratings (T) 

In the domain of third person ratings, parent and teacher ques- 
tionnaires were grouped, dealing with aspects such as aggressive 
or oppositional behavior, social skills, emotional competence, 
well-being, attention, and self-regulation (1, 2, 6, 8, 13, 18, 21, 

22, 24). 

Another study measured school attendance (6). Since this 
measure does not fit any of the domains, it was not included in the 
domain-specific analyses. The numerical proportions of measures 
applied in studies are portrayed in Figure 2. 

FEASIBILITY 

Only some of the studies offered information about how the 
integration of the program into school-routine was working. In 
some studies, one or more aspects of feasibility were assessed sys- 
tematically via questionnaires, focus groups, or interviews. Some 
reported a systematic assessment, but did not provide a report 
or an analysis of respective data. Others reported only anecdotal 
evidence. 

ACCEPTABILITY 

One third of studies provide information about acceptability. 
There seems to be an overall high acceptability in those studies 
referring to students and teachers, but, again, methods were partly 
heterogeneous and unsystematic. 

Results of interviews and focus groups (teachers and students) 
indicate a uniformly positive experience of the intervention 




FIGURE 2 | Numerical proportions of measures applied in studies. 



(Beauchemin et al., 2008; Mendelson et al, 2010; Lau and Hue, 
2011). Eighty-nine per cent of the students would recommend the 
training to others (Broderick and Metz, 2009; Metz et al, 2013). 
In Anand and Sharma's study (in press) 81% of the students rated 
the program sessions as extremely useful, and 83% as satisfying. 

Three quarters of the students said that they would like to con- 
tinue, and thought that it could have lasted longer (Beauchemin 
et al., 2008; Huppert and Johnson, 2010), or that it was the right 
length (Anand and Sharma, in press). Only 5% thought that the 
intervention was too long (Huppert and Johnson, 2010). Potek 
(2012) cited a noteworthy statement: "We just started getting it. I 
think we should have more time to practice." 

Some of the programs also contain an individual home prac- 
tice: Huppert and Johnson (2010) found that one third practiced 
at least three times a week and two thirds once a week or less. In 
Broderick and Metz's study (2009), two thirds of the participants 
practiced mindfulness techniques outside the classroom. By ana- 
lyzing the protocols, Frenkel et al. (in press) found that no one 
practiced the full amount of weekly exercises and two thirds failed 
to do their homework at least once. 

IMPLEMENTATION 

Joyce et al. (2010) mentioned specific factors which facili- 
tated successful implementation: teaching along with colleagues, 
administrative and parental support, or children's enthusiasm. 
What hindered was a lack of time and students who failed to 
engage with the program. In the study of Beauchemin et al. 
(2008), teachers suggested that the intervention was feasible 
when conducted in a classroom with voluntary participation. 
Desmond and Hanich (2010) mentioned problems regarding 
scheduling, completion of administration, beginning of holidays, 
and difficulties with participants arriving too late. Some studies 
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provided information about feasibility of different program- 
elements, and very few reported implementation integrity which 
had been assessed via protocols, detailed scripts, feedback for- 
mulas, or fidelity logs. Because these data were rare we did not 
include them in the analysis of outcomes. 

QUANTITATIVE SYNTHESIS 

Within-group effect size 

The results of the quantitative synthesis are reported in Table 3. 
Weighted mean effect sizes for within-group effect sizes was 



g = 0.41 (95% CI 0.28-0.54), which can be considered as a 
small to medium effect. The Q statistic indicates heterogene- 
ity, and the 12 index shows that a large amount of variance is 
caused by it. The fail-safe number exceeded the criterion. Figure 3 
shows a funnel plot of the respective 24 effect sizes where the 
vertical bar marks the weighted mean effect size. Asymmetry 
can be seen: Studies with small sample sizes and small or even 
negative effects are lacking. Only a few studies, with rather 
small sample sizes, are located above the estimated mean effect 
size. Sensitivity analyses, excluding the five studies with partly 



Table 3 | Overall within-group and controlled effect sizes and respective subgroup effect sizes, including effect size statistics. 



Type of effect size 


Sample 




Effect size 




T 2 




Homogeneity 




fe fs a 


Criterion 11 


K 


n 


Hedges's g 


95% -CI 


P 


Q 


P 


I 2 


Within-group effect 


24 


1348 


0.41 


(0.28, 0.55) 


<0.00001 


0.08 


112.52 


<0.00001 


80% 


1008 


130 


Excluding estimated ES (#) 


19 


917 


0.49 


(0.31, 0.67) 


<0.00001 


0.12 


104.86 


<0.00001 


83% 


912 


105 


Excluding studies W < 40 


12 


990 


0.31 


(0.18, 0.44) 


<0.00001 


0.04 


42.77 


<0.00001 


74% 


360 


70 


Subgroup Franco 


3 


103 


1.32 


(1.05, 1.59) 


<0.00001 


0.00 


0.92 


0.63 


0% 


393 


25 


Subgroup rest 


21 


1245 


0.29 


(0.19, 0.40) 


<0.00001 


0.03 


53.68 


<0. 0001 


63% 


588 


115 


Controlled effect 


19 


1897 


0.40 


(0.21, 0.58) 


<0.0001 


0.11 


59.35 


<0.00001 


70% 


722 


105 


Excluding estimated ES (#) 


16 


1445 


0.45 


(0.23, 0.68) 


<0.0001 


0.14 


54.83 


<0.00001 


73% 


704 


90 


Excluding studies n < 77 


3 


656 


0.31 


(0.15, 0.46) 


0.0001 


0.0 


0.10 


0.95 


0% 


90 


25 


Subgroup Franco 


3 


205 


1.34 


(1.04, 1.65) 


<0.00001 


0.00 


1.83 


0.40 


0% 


399 


25 


Subgroup rest 


16 


1692 


0.23 


(0.13, 0.33) 


<0.00001 


0.00 


11.05 


0.75 


0% 


352 


90 



K, number of studies; N/n, number of participants; g, weighted mean effect size; CI, confidence interval; z 2 , variance component; p, level of significance; Q, 
Q — Statistic. a k fs is the number of unavailable studies with null results, that would be required to reduce the overall result to an insignificant level. b lf k fs is 
exceeding the criterion (5k + 10), an essential influence of availability bias is unlikely. 
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FIGURE 3 | Funnel plot of within-group effect sizes (K = 24). The vertical bar represents the weighted (by sample sizes) mean effects size. 
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estimated ES (#) from synthesis, lead to slightly higher ES (g = 
0.49; 95% CI 0.31, 0.67) and more between study variance (r 2 = 
0.12). Synthesis only of studies with a minimum sample size of 
41 (K= 12) revealed an ES of.31 (95% CI 0.18, 0.44) and a 
tau-squared of 0.04. 

Controlled effects sizes 

Weighted mean effect size of the 19 studies using a controlled 
design was g = 0.40 (95% CI 0.21, 0.58), a small to medium 
effect. Again there was evidence for heterogeneity. The fail-safe 
N criterion is exceeded. The funnel plot follows a similar pat- 
tern of asymmetry as in pre-post effect sizes, which can be seen 
in Figure 4. On the other hand, the fail-safe number of 722 
exceeded clearly the criterion (105), indicating the robustness of 
results concerning availability bias. Sensitivity analyses excluding 
estimated ES (#) showed a similar ES (g = 0.44; 95% CI 0.23, 
0.68) and a larger between study variance (r 2 = 0.14). Synthesis 
only including studies with an adequate ES of n = 78 or higher 
(K = 3) yielded a lower ES (g = 0.31; 95% CI 0.15, 0.46) and no 
between study variance (r 2 = 0.00). 

Exploratory analyses 

Examining ES and plots, the three studies from the Franco lusto 
research group were categorized as one subgroup. In three inde- 
pendent studies, the effects of the Meditation Fluir program 
were explored. This very sophisticated, demanding, and well- 
established program for graduating high-school students clearly 
differentiates itself from other interventions by a very high inten- 
sity. A subgroup analysis was performed for within-group effect 
size and controlled effect size. Separate analysis leads to a slight 
reduction of heterogeneity in within-group effect sizes and to 



complete reduction of heterogeneity in controlled effect sizes (see 
Table 3). In both cases CI intervals do not overlap, and the per- 
centage of genuine subgroup differences is 98%. Differences of 
subgroup effects were significant for within-group effects sizes 
(X 2 = 50.21, p < 0.00001) and controlled effect sizes (x 2 = 
46.47, p < 0.00001). 

To investigate whether the intensity of mindfulness train- 
ing explains part of the heterogeneity between ES of all stud- 
ies reviewed, a random-effects meta-regression was performed. 
Minutes of mindfulness practice in total (including training ses- 
sions and home practice, if it was compulsory) were entered 
as a predictor and ES as the outcome variable. Studies were 
weighted by inverse variance, combining within-trial variance 
of treatment effect and the between study variance. As can be 
seen in Figures 5, 6, there is a substantial correlation between 
ES and minutes of mindfulness training for controlled ES, and a 
slightly weaker correlation for within group ES. Regression analy- 
sis shows that intensity of mindfulness practice accounts for 21% 
(adjusted R 2 = 0.21) of heterogeneity in within-group ES and 
52% (adjusted R 2 = 0.52) of heterogeneity in controlled ES (see 
also Table 4). The three studies with the highest intensity driving 
the strong correlations were those from the Spanish Franco Justo 
research group. 

Outcomes of quantitative synthesis for each domain are pre- 
sented in Table 5. Effect sizes in the domain of cognitive per- 
formance were moderate to high, whereas effect sizes of the 
stress and resilience domains showed small to moderate ES. 
The domain of emotional problems and third person ratings 
demonstrated small ES and CI's overlapping zero. High levels of 
heterogeneity could be identified in all domains except emotional 
problems. In the domain of emotional problems, heterogeneity 




Weighted Mean Effect Size (g) 



FIGURE 4 | Funnel plot of all controlled effects sizes [K = 19). The vertical bar represents the weighted (by sample sizes) mean effect sizes. 



www.frontiersin.org 



June 2014 | Volume 5 | Article 603 | 15 



Zenner et al. 



Mindfulness-based interventions in schools 



2 




5000 



FIGURE 5 | Bubble plot of the 24 within group effects sizes against 
Intensity of mindfulness Training and regression line. R 2 (adjusted) = 
0.21. 



2 




5000 



FIGURE 6 | Bubble plot of the 19 controlled effects sizes against 
Intensity of mindfulness training and regression line. R 2 (adjusted) = 
0.52. 



Table 4 | Results of random-effects meta-regression on intensity of 
mindfulness training for within-group and controlled effect sizes. 

Model B SE B Beta Sig. 

WITHIN-GROUP EFFECT SIZE 

1. (Constant)lntensity (Minjn) -1.121 0.583 0.068 

0.246 0.093 0.490 0.015 

CONTROLLED EFFECT SIZE 

1. (Constant)lntensity (Minjn) -1.910 0.512 0.002 

0.359 0.080 0.738 0.000 



was at a medium level and according to the Q-test, absence of het- 
erogeneity can be assumed. The fail-safe N criterion was exceeded 
considerably in all 5 domains. 

DISCUSSION 

This is the first systematic review and meta-analysis to summa- 
rize data available on the effects of mindfulness-based trainings 
for children and youths in a school setting. Twenty-four stud- 
ies were located that report a significant medium effect size of 
g = 0.40 across all controlled studies and domains. Remarkably, 
the ES of studies using pre-post designs only is very similar, 



with g = 0.41. The effects are strongest in the domain of cog- 
nitive performance with a large and significant ES of g = 0.80 
for controlled studies. Effect sizes are smaller but still signifi- 
cant in the domains of resilience measures (g = 0.36) and stress 
measures (g = 0.39), and they are small and not significant for 
measures of emotional problems (g = 0.19) and third-person 
ratings (g = 0.25). In the latter two domains pre-post ES are 
larger, while in all other domains they are either very similar to 
the controlled ES or even somewhat smaller. Thus, taken from 
a bird's eye view, mindfulness-based training in a school con- 
text has effects that are seen mostly in the cognitive domain, but 
also in psychological measures of stress, coping, and resilience. 
Acceptance seems to be high with few reported adverse events 
or incidents. There were some hints that implementation was 
not always without difficulties. It is important to keep in mind 
that the analysis referring to feasibility is very limited due to 
methodological issues. 

STRENGTHS 

We went to great lengths to locate all relevant studies and get more 
detailed information from authors. Since all but two authors 
complied with our requests, our work is novel and complete. A 
third of the material included in this review is unpublished gray 
literature. Hence, we are confident that availability bias was com- 
paratively small. Although the funnel plot seems to indicate such 
a bias, one should bear in mind that the asymmetry is mainly 
caused by three studies with large ES stemming from one group 
in Spain that have developed a very intense mindfulness train- 
ing. Excluding those studies from the visual analysis of the funnel 
plot renders it symmetrical, thus testifying to our success at locat- 
ing the most relevant studies. Also, the large fail-safe Ns show 
that the results are robust regarding availability bias. In most 
cases, more than twice the number of available studies would 
be needed to render the ES insignificant, a rather unrealistic 
assumption. 

We adopted conservative quantitative estimation methods. 
When SD and Means were unavailable, ES of measures were set 
to zero. We corrected for baseline differences by using difference- 
scores as the basis of ES estimation. By using correction factors 
for small studies, larger studies receive more weight, and by using 
random-effects models the large variation is taken into account. 
By analyzing studies both through overall ES and domain spe- 
cific ES, we tried to disentangle the maze of very diverse outcome 
measures employed in those studies. We took care to not inflate 
ES by only using one contribution per outcome measure to each 
study. Data were inspected carefully in terms of heterogeneity 
and biases and various sensitivity analyses were computed. By 
exploring the variation through meta-regression we were able to 
account for a sizeable portion of the variance through one theo- 
retically important variable, namely the amount of practice (i.e., 
the intensity) implemented in the study, which accounts for 52% 
of the variance in the controlled studies and 21% of the variance 
in pre-post-design studies. Given the heterogeneity of measures, 
students, settings, and programs, this is a remarkable finding that 
suggests that one of the most important factors for the variation 
across studies is the amount of practice that a mindfulness based 
program has introduced. 
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Table 5 | Domain specific effect sizes and statistics for within group and controlled effects sizes respectively. 



Domain 


Type of effect size 




Sample 




Effect size 


Heterogeneity 


K 


n 


Hedges's g 


95%-CI 


i 2 


Cognitive performane 


pre-post 


8 


327 


0.68 


(U.oo, I.Uo) 


88% 




Controlled 


7 


569 


0.80 


(U.ob, I. zb) 


82% 


Emotional problsms 


pre-post 


1 1 




U.o I 


in iq n a o\ 


A AO/ 
H-H /O 




Controlled 


9 


903 


0.19 


(-0.03, 0.41) 


52% 


Stress 


pre-post 


8 


374 


0.36 


(0.05, 0.66) 


85% 




Controlled 


7 


674 


0.39 


(0.07, 0.71) 


78% 


Factors of resilience 


pre-post 


17 


1082 


0.38 


(0.20, 0.55) 


86% 




Controlled 


13 


1497 


0.36 


(0.09, 0.62) 


82% 


Third person Ratings 


pre-post 


8 


448 


0.34 


(0.08, 0.60) 


84% 




Controlled 


6 


591 


0.25 


(-0.10, 0.61) 


74% 



K, number of studies; n, number of participants; g, weighted mean effect size; CI, confidence interval. 



LIMITATIONS 

This is simultaneously the major limitation of our findings: the 
heterogeneity of the studies is considerable, and hence the esti- 
mates of effect sizes, including their significance, can only have an 
orienting function. It is plausible that school-background, social 
background, and how a program is accepted within a particu- 
lar school context influence its effects, yet we do not have the 
information necessary to explore these effects or those of other 
potential moderators. For instance, it is a completely different sit- 
uation if pupils attend within the compulsory school framework 
or are willing to stay on in their free time, whether there is a class- 
room or workshop setting. Furthermore, it makes a difference 
if teachers themselves implement programs or if outside train- 
ers come and deliver the courses. Additionally, the instructors' 
qualifications and their personal experience with mindfulness are 
surely important. A lot of this information may be decisive, yet is 
not available in study reports. 

As is the case with any nascent field of research, the hetero- 
geneity is also built in through the exploratory framework of most 
studies. In only a few cases, such as with the Franco Justo research 
group, were studies conducted in replication. Mostly, researchers 
implemented their own programs. Therefore, a variety of pro- 
grams were evaluated or tested. Thus, there are no manualized 
consensus programs available, as is the case with MBSR or MBCT 
Also, outcome measures for children are much less stable, both 
psychometrically and age-wise. By default, a lot of tests available 
for children are only partially validated, or are sometimes used 
in age groups where no clear validation exists. Also, some of the 
measures might have exhibited floor or ceiling effects, especially 
when clinical measures are used for groups that are within nor- 
mal range. While the motivation of patients studied in clinical 
studies of MBSR and MBCT is comparatively easy to gauge, such 
a motivation is less clear for children. This source of variance was 
completely out of reach for us, as only one study documented 
motivation. 

Studies are often underpowered and small. This is not a sur- 
prise, given the exploratory nature of the field. It means, however, 
that the findings are tentative and need to be supported by larger, 
more robust evaluations in groups that are representative of 



settings where such trainings will likely be implemented. It also 
means that a large proportion of the effect size is derived from 
studies where the study size is small and hence the variation is 
large. Synthesis only including studies with an appropriate sample 
size revealed an ES of.31 for pre-post as well as controlled ES. The 
decrease in ES and heterogeneity indicates that our results might 
be slightly biased by the "small-study effect" (Sterne et al., 2000), 
which leads to an overestimation of ES. As a result, an overall ES 
of 0.31 is a more stable estimate. 

None of the studies used a strong active control. Hence the 
ES estimate is for an effect which has not been compared with 
another intervention or control. The precise role the element of 
mindfulness really plays is unknown, as is the extent of the effect 
that can be attributed to non-specific intervention factors, such as 
perceived group support, the specialty, and novelty of the inter- 
vention, of taking time out in school and at home, or of generic 
resting and relaxing. We only have one indirect indicator, and 
this is the strong correlation between ES and mindfulness training 
intensity revealed by the meta-regression. 

COMPARISON WITH OTHER FINDINGS 

This is the first analysis of its kind regarding school based MBIs, as 
far as we are aware. Meta-analyses have been carried out in other 
fields, such as the clinical effects of MBSR in adults (Grossman 
et al., 2004). This first analysis isolated an ES of approximately 
d = 0.5, for patients and non-patients, for physical and mental 
health measures alike. In a more recent meta-analysis by Eberth 
and Sedlmeier (2012) an ES of r = 0.31 was found for the effect 
of MBSR in non-clinical adult populations, based on a larger 
amount of studies (k = 17). Thus, effects of MBIs in non-clinical 
settings seem to be slightly higher in adults than in children and 
youth. 

However, the ES we derived in this analysis are in the same 
range as results of other meta-analyses of school-based pre- 
vention programs. A meta-analysis of school-based social and 
emotional learning programs, for example, revealed an overall 
ES of g = 0.30 and an 12 of 91% (Durlak et al, 2011). Also, 
the ES of 3 domains, namely emotional problems, resilience, 
and third person ratings, showed similar ES compared to 
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respective categories in larger meta-analyses of school-based pre- 
vention programs. However, effects on academic achievement 
were lower in other meta-analyses (Durlak et al, 2011; Sklad 
et al., 2012). ES of stress and coping measures were much 
higher (g = — 1.51) in studies targeting stress directly than in 
this study (Kraag et al., 2006). Levels of statistical heterogene- 
ity of the referred studies were about the same magnitude as in 
our study. 

SUGGESTIONS FOR FURTHER WORK 

It is obvious that more research, especially larger and randomized 
studies, if possible with active controls, is needed. Also, longer 
follow-up measures would be appropriate, primarily to see if 
benefits are lasting, but also to investigate potential effects of 
triggering developmental steps. Besides, attrition rates, includ- 
ing reasons for dropout, should be reported, because relevant 
information regarding implementation strategies, feasibility, and 
contraindication might be extracted. Great consideration must 
be given to outcome measures. As our analysis shows, the 
effects of mindfulness-based interventions can be rather dif- 
ferentiated across domains. A lot of the scales used are not 
really adequate. Researchers might want to pilot their measures 
before using them or employ measures that have been sensitive 
in other studies. Further, it would make sense not to exclu- 
sively rely on self-report data and questionnaires in general, 
but to triangulate measures with qualitative data and behav- 
ioral measures. Using qualitative approaches, new hypotheses 
could be generated and other adequate methods could be devel- 
oped. Manuals of the intervention studied should be made 
available. 

To prevent unnecessary failure in implementation, studies 
should use a mixed-methods approach to assess outcome and 
acceptability, adopting methods such as written teacher reports, 
review sessions, individual interviews, observations of training 
sessions and student questionnaires and interviews. For exam- 
ple, Greenberg et al. (2004) have described a number of criteria 
such as timing, dosage and quality of sessions, student absen- 
teeism and responsiveness, teacher experience, and commitment. 
It should be determined which aspects of the implementation 
process are most important, and what adaptations can be made 
without harming the integrity of the intervention. All this can 
only be investigated if adequate information is provided. This 
will allow future meta-analysts to assess sources of heterogeneity 
better than we were able to. 

What is also clear from our study is that implementing and 
studying mindfulness-based interventions in schools is a promis- 
ing avenue. Although not formally assessed, from our own expe- 
rience and in accordance with others (Roeser et al., 2012), we 
suggest a good model might be to train teachers in mindfulness. 
They could then promote mindfulness in their pupils through 
teaching mindfully, and through teaching mindfulness directly in 
diverse settings. For if mindfulness is to be established in a school- 
based framework it will have to be teachers who are the agents 
and ambassadors of change. This might be a good resource for 
teachers' own resilience and prevention of burnout, in addition 
to being, very likely, the best way of delivering mindfulness in 
schools. 



SUMMARY 

Our analysis suggests that mindfulness-based interventions for 
children and youths are able to increase cognitive capacity of 
attending and learning by nearly one standard deviation and yield 
an overall effect size of g = 0.40. The effect is stronger in stud- 
ies where more mindfulness training and home practice has been 
implemented. However, results might be slightly biased by the 
"small study effect." Furthermore, the heterogeneity is large and 
thus further work, especially locating the origin of the heterogene- 
ity, is needed. We suggest that larger studies using robust and well 
validated measures be conducted, and that active controls should 
be considered. The available evidence certainly justifies allocating 
resources to such implementations and evaluations, since MBIs 
carry the promise of improving learning skills and resilience. 
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